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THE  PROBLEM  OF  SELECTING  A  GIVEN  NUMBER  OF  REPRESENTATIVE 
POINTS  IN  A  NORMAL  POPULATION  AND  A 
GENERALIZED  MILLS'  RATIO 

By 

Kai-Tai  Fang  and  Shu-Dong  He 


1.  Introduction. 

The  problem  of  selecting  a  given  number  of  representative  points  to 
retain  as  much  information  of  the  population  as  possible  arises  in  many 
situations . 

For  example,  in  order  to  standardize  clothes,  we  take  p  measure¬ 
ments  of  the  body  of  each  of  n  individuals  (in  general,  n  is  sufficiently 
large),  and  project  these  p  dimensional  data  onto  a  q  dimensional 
space  (q  ~  1,2  or  3)  by  principal  components  analysis  of  by  some  other 
method.  We  wish  to  select  m  points  that  best  represent  the  data  in  the 

q-dimensional  space.  In  Fang  (1976),  this  problem  is  analyzed  for  one 
and  two-dimensional  normal  distributions  where  the  intervals  are  of  equal 
lengths  and  the  m  points  are  centered  in  each  interval. 

Bofinger  (1970)  studied  the  question  of  grouping  a  continuous  bi¬ 
variate  distribution  by  intervals  on  the  marginals  thereby  obtaining  a 
discrete  bivariate  distribution.  She  sought  the  grouping  that  would 
provide  the  maximum  possible  correlation  between  these  marginal  variables. 

The  solution  is  approximate  when  both  marginals  are  grouped  and  exact  when 
only  one  margin  is  grouped.  For  the  bivariate  normal  distribution  with 
one  margin  grouped,  tables  of  interval  end  points  that  maximize  the 
correlation  are  provided  up  to  10  intervals. 
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Prior  to  this,  Sitgreaves  (1961)  arrived  at  the  same  bivariate  model 
as  Bofinger  in  connection  with  a  query  about  determining  the  optimal  item 
difficulties  in  a  special  mental  test  design  in  psychometrics.  The  optimal 
representative  points,  for  k  =  2,3,4, 5  intervals  under  a  bivariate  normal 
structure  are  determined  in  that  paper  by  a  graphical  procedure 
thus  producing  the  optimal  item  difficulties  for  a  test  with  k 
items. 

A  paper  by  Max  (1960)  seeks  to  quantize  the  univariate  normal  dis¬ 
tribution  so  that  it  can  be  represented  by  k  points.  Employing  a  mean 
square  error  loss  function.  Max  provides  tables  for  k  =  1,2,..., 36  to 
yield  optimal  representative  points  for  digitizing  the  univariate 
normal.  There  is  a  previous  paper  by  Cox  (1957)  whose  motivation  is 
quantization  and  who  for  the  univariate  normal  provides  a  table  of  optimal 
representative  points  for  k  =  2,..., 6  groups.  In  Anderberg  (1973)  there  is 
discussion  of  sectionalizing  the  univariate  normal  distribution  to  transform 
interval  data  to  ordinal  data  for  clustering  procedures  and  some  abridged 
tables  are  given. 

Max  was  motivated  by  a  signal  processing  problem  and  it  is  in  this 
vein  that  Zador  (1963)  considered  how  to  select  a  random  discrete  vector 
in  one  and  higher  dimensions  to  approximate  a  continuous  variable  employing 
a  mean  square  error  loss  function.  He  provided  a  generalized  model  for  the 
multivariate  normal  distribution  and  secured  bounds  on  quantization  error 
as  a  function  of  dimension  and  moments  of  the  error  term  as  the  number 
of  representative  points  increased.  A  revised  version  of  this  work 
appears  in  Zador  (1982) .  The  tables  produced  by  these  investigators 
and  the  tables  in  this  report  yield  the  same  listings  except  for  compu¬ 
tational  accuracy.  However,  this  report  and  the  paper  by  Max  contain 
the  most  extensive  tables. 
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We  have  been  interested  in  this  subject  since  we  worked  on  the 
standardization  of  clothes.  In  October  1981  we,  just  as  previous 
authors,  obtained  independently  the  results  in  this  paper.  Later 
Professor  T.  W.  Anderson  and  V.  Srinivasan  told  us  of  Cox’s  and  Bofinger's 
work.  Recently,  Professor  H.  Solomon  introduced  Zador,  Max  and  Sitgreave's 
work  to  us.  It  is  a  very  interesting  story  in  the  history  of  Statistics 
that  several  investigators,  motivated  by  different  applications  were  led 
in  the  same  methods,  and  obtained  independently  similar  results  over  a 
period  of  more  than  twenty  years.  Compared  to  other  papers,  ours  gives 
more  theoretical  proof  of  the  computation  of  the  representative  points. 
Perhaps  it  is  valuable  for  people  who  want  to  compute  more  representative 
points  or  to  compute  the  representative  points  in  two  dimensional  space. 

In  addition  we  give  some  basic  results  on  the  generalized  Mills’  ratio 
that  may  be  useful  in  statistical  analysis. 

2 

Suppose  the  distribution  of  the  population  is  N(y,a  ),  where  y 
and  o  are  known.  We  wish  to  determine  m  points  x1,X2»...,xm  that 
are  the  best  representatives  of  the  population. 

What  is  the  meaning  of  "best”?  We  consider  the  loss  function 


(1.1) 


f(x 


1’ 


’Xm} 


rC°  x.-x  2 

min  (— — )  <{>  (x)  dx 

-_co  i<  i<m 


where  cj>(x)  denotes  the  density  of  the  normal  variate  with  mean  y 
2 

and  variance  o  .  Without  loss  of  generality,  we  can  assume  y  =  0, 
and  o  =  1 . 

In  order  to  study  properties  of  the  solutions,  we  generalized  Mills' 
ratio,  and  give  some  basic  properties  of  the  generalized  Mills'  ratio  in 
Section  3.  In  Section  4  we  discuss  properties  of  the  solution  of  some 
equations.  As  a  consequence,  a  computational  procedure  is  suggested  and 
a  table  of  x^,,..,xm  for  m  _<  31  obtained  by  computer  is  given  in 
Section  5. 
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2. 


Preliminaries. 


Rewrite  (1.1)  with  y  =  0,  a  =  1  as 


(2.1) 


f(x1,...,xm)  = 


r(x1-te2)/2/  ^ 


(x~x^)  <j>  (x)  dx 


r(x»+Xn)/2  _ 

(x-x„)  <f>(x)dx  +  •  •  • + 

(xx+x2) /2 


+00 


(x  _+x  )/2 
m-1  m 


(x-x^)  <j>(x)dx  , 


_ig  _nx2 

where  x.  <  •••  <  x  and  4> (x)  =  (27r)  ze  .  In  order  to  find 
1  m 

x. , . . . ,x  ,  we  solve  the  first  derivative  equations: 

1  m 


3f  (x.^  >  •  •  •  ,  x^) 
3x . 


0  j  i  -  1,2, ... , m 


We  obtain  the  following  equations: 


r%(x,+x  ) 

(x-x^)<j>(x)dx  =  0  , 

^  —00 


(2.2) 


/•%(x„+x„) 

(x-x9)<J)(x)dx  =  0  , 

'  %(x^+x2) 


%(x  ,+x  ) 

m-1  m 


(x-x  )<£(x)dx  =  0  . 
m 


Lemma  1.  The  solution  of  the  equations  (2.2)  is  symmetric  about  the 
origin,  i.e.,  x^  =  -x^  i=l*2, . . .  ,m. 
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Proof.  Let  X  ^  N(0,1),  using  the  symmetry  of  the  density  of  X  about 
the  origin  we  have 


E(min(x^-x) ^)  =  E(min(x^+x) =  E(min(-x^-x)^) 


the  lemma  follows.  Q.E.D. 


From  Lemma 


1,  we  only  find  0  <  x^  <  x2  <  * • •  <  if  m  =  2k  is 


even  and  0  =  x_  <  x,  <  • • •  <  x.  if  m  =  2k+l  is  odd.  And  the  loss 
0  1  k 

functions  become  respectively 


^(x.+x.)  7 

(2.3)  f(x  ,  ...,x  )  =  (x-x..)  cj>(x)dx 

l  m  jQ  i 


■*g(x2+Xo)  o  f°°  ? 

(x-x,)  (f)(x)dx+ •  • .  +  (x-x.)  <{>(x)dx  , 

%(x1+x2)  %(Xk_i+Xfc) 


(2.4)  f(xn,...,x  )  =  x^{j>(x)dx 

0  m  J0 


a(x.*o  2  r 

+  (x-x..)  <j)(x)dx  +  •  •  •  + 

>h  xL  J  «3tk-l*,tk> 


(x-xfc)  4>  (x)  dx  . 


Let  3f  (x^, . . .  ,xk)/9xi  =  0  and  3f  (XqjX^^,  . . .  ,xk)/3xi  =  0,  i=l,2,...,k,  and 
we  obtain  the  following  two  systems  of  equations:  if  m  =  2k 
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(2.5) 


<f>(0)  -  <|>(^(x1+x2))  =  x1[$(%(x1+x2))  -  $(0)]  , 

^(%(x1+x2))  -  <J)(^(x2+x3))  =  x2[3>(%(x2+x3))  -  $(%(x1+x2))]  , 

V  •••  •  •  •  ••• 

4>(%(xk_2^k-l))  ~<|)(i5(xk-l+xk))  =  xk-lt$(^(xk-l-tek))-$(3s(xk-l+Xk-2))]’ 

.  ^(^(Xj^.i+Xj^))  =  xkIl-0(J2(xk_1+xk))]  , 


and  if  m  =  2k+l, 


(2.6) 


'<t>  (k  Xx)  -  ^)(%(x1+x2))  =  x1[$(%(x1+x2))  -  $(%  Xx)]  , 
(|)(%(x1+x2))  -  4>(%(x2+x3))  =  x2[$(%(x2+x3))  -  $(%(x1+x2))]  , 


♦<1*(xk-2+,‘k-l))  '  (^(xk-l^))=Vl[$(^(Vl+Xk))-$fi2(xk-2+Xk-l))]  ’ 

(^(Xk-1+Xk^  =  Xk[1“  (is(xk-l+Xk))]  ' 


Where  $(x)  is  the  normal  cumulative  distribution  function  with  mean  0 
and  variance  1.  For  any  0  <_  x  <  y  00  define  a  function 


(2.7) 


M(x,y) 


$(y)  -S(x) 
(x)  -  4>(y) 


When  y  =  00 ,  M(x,=°)  is  the  usual  Mills’  ratio  M(x) ;  thus  we  call 

M(x,y)  the  generalized  Mills’  ratio.  Now  the  systems  of  equations  (2.5) 
and  (2.6)  can  be  rewritten  as  follows  with  xk+3  =  00 : 

(2.8)  xiM[(%(xi_1+xi) ,  Jg^+x^)]  =1,  i  =  l,2,...,k  , 
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where  xQ  =  -x.^  if  m  =  2k  and  xQ  =  0  if  m  =  2k+l.  In  order  to  solve 
the  system  of  equations  (2.8),  it  is  necessary  to  study  some  properties 
of  M(x,y). 


3.  Generalized  Mills'  ratio. 

It  can  be  verified  that  the  generalized  Mills'  ratio  defined  by  (2.2) 
satisfies 


(3.1)  M^(x,y)  =  M(x,y)  =  (1-e  ^ y  X  ^)  1[xM(x,y)-l]  , 

2  2 

(3.2)  M'(x,y)  =  M(x,y)  =  (e^y  x  ^-l)-1  [l-yM(x,y)]  (for  y  <  °°)  . 

y  ay 


In  general,  we  have  by  induction 


(3.3) 


.  k+£ 


— 5“  M(x,y)  =  u  (x,y)M(x,y)-v  0(x,y)  , 

3x3  y 


where  u  (x,y)  and  v,  0(x,y)  are  defined  by  the  following  recursive  formulae: 


r uo,o(x>y)  ’  vo,o(x'y)  "  0 
uk+l,jl<x’y>  '  af  “k,*(x-y)  +*q(*.y)ukit<*,7>  . 


<3. 4) 


vk+l,I’<x’y)  *  q(x’y)\,S.<x,y)  +  Tk,Jl(X,y)  ’ 

\,m(x>y)  =  37  \,JiCx,y)  +  7(HU,7))\|t(*.y)  . 


vk,*+l<x,y)  ■  (1-q(x’y>>ak,it<x’y)  +  wvk,*(x’y)  ’ 


and 


(3.5) 


i  ,  2  2n  , 

q(x,y)  -  (l-e-^(y  -x  ))-1 
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In  particular,  from  (3.4)  we  find  that 


ui,o(x,y)  =  xq^x’y^» 

vx  0(x,y)  =  q^x»y)» 

u0,i(x,y)  =  y (l-q(x,y) ) , 

vo,i(x,y)  =  1-q(x>y)> 

U1  l^x,y^  =  2xyq(x,y)(l-q(x,y)) , 

vi  i^x,y^  =  (x+y)q(x»y) (i-q(x»y)) 

u2  Q(x,y)  =  2x2q2(x,y)+(l-x2)q(x,y) , 

2 

v2,0<x>y>  =  2x9  (x,y)-xq(x,y) , 

u0,2<x>y>  =  2y2q2(x,y)-(3y2+l)q(x,y)+(y2+l) , 

v0,2(x»y)  =  y[l~3q(x,y)+2q2(x,y)]  . 

Consequently,  u^  ^(x,y)  and  vk  ^(x,y)  are  polynomials  in  q(x,y),  i.e 


(3.6) 


k+jj, 

\  o(x’y)  =  I  aJk*A)(x,y)qJ(x,y)  , 
j=0  3 


k+£ 


v  (x,y)  =  l  ba’S')(x,y)q:,(x,y)  , 
jo0  3 


wher  e  a^k’^  (x,y)  and  b^’^^(x,y)  are  polynomials  of  x  and  y  in 
which  the  power  of  x  is  less  or  equal  to  k  and  the  power  of  v  is 
less  or  equal  to  £.  By  using  (3.4)  and  (3.6),  we  can  obtain  the  recur¬ 
sive  formulae  for  a^’^^(x,y)  and  b^k’^(x,y),  too. 

It  is  well  known  that  (cf.  Feller  (1968),  p.  193) 


(3.7) 


\  _  1  1,1.3  1.3.5 

M(X)  •<.  -  -  -J  +  — - 7 

XX  X 


+ 


,  nn  (2n-l) !  I 

v  '  2n+l 
x 


> 


where  (2n-l)!J  =  (2n-l) (2n-3) • • • 3. 1.  Similarly,  by  using  integration  by 
parts  we  can  find 


8 


(3.8) 


(3.9) 


i  2  .  2 

(e-^x  -  <r%y 


f  y  ~t~ 

e  £  dt 


e  *  )  M(x,y)  = 

,  (I  eV.  1  eV,  .  P  1  e-ht 


dt 


x  t 


i  i  2  1  !  2  _  i  2  n 

=  )  .  (-4  e->2*  .  4  e-’S'  )  +  3 

J  -  y 


X 


ry  _1  -*t2 

4  e  ^t 

x  t 


-  l  (-l)j(2j-l)!!(x-(2J+1)e-lsx2-y-(2j+1)e-?sy2) 
j=0 


,,  fy  ^-2k.-%t2 


+  (-l)k+1(2k+l)!I  I  t '“e  '**’  dt 

x 


=  Lk  +  Z2k  <say)  • 


_1^2  _!  2  ^ 

Then  (e  2  -e  ^  )  I  is  an  overestimate  of  M(x,y)  if  k  is  zero 
or  even  and  is  an  underestimate  of  M(x,y)  if  k  is  odd.  The  following 
lemma  is  useful  later. 


Lemma  2.  M’ (x,y)  and  M'(x,y)  are  negative  for  0  <  x  <  y  <  00  and 

X  y 


for  0  x  <  y  <  <»,  respectively. 


Proof.  From  (3.1)  and  (3.2)  we  only  prove  that 


(3.10) 

and 

(3.11) 


x  M(x,y)-1  <  0  for  0  <_  x  <  y  <_  00  , 


1-y  M(x,y)  <  0  for  0  _<  x  <  y  < 


By  using  (3.8)  we  have 
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i  2  i  2  _i  ,2  ry  i  i  *2 

xM(x,y)-l  -  (e-*  -e^  )  1^  (1-5) -x  -j  dt]  . 

y  ''x  t 


For  any  fixed  x  >.  0,  define  a  function  of  y 


g(y) 


=  eva-f)-xf4 

y  ->x  t 


-%t2 

e  dt  ,  y  >  x 


Then  xM(x,y)-l  <0  if  and  only  if  g(y)  <  0  for  y  >  x.  The  latter 

oo  x  -%t2 

is  true  because  g(x+0)  =  0,  g(°°)  =  -x  /  e  dt  <  0  and 


_i  2 

g' (y)  =  -(y-x)e  ^  <  0 


The  inequality  (3.10)  can  be  proved  by  a  similar  method.  Q.E.D. 

In  the  next  section  we  need  the  following  lemma. 

Lemma  3.  For  x  >  0 

4>(x)  (x+2M(x))  <  1  . 

Proof.  Let  f(x)  =  <j>(x)  (x+2M(x) )-l.  Then  the  lemma  follows  the  fact  that 
f(0)  =  0  and  f'(x)  =  -c|)(x)  (1+x2)  <  0. 

4.  Some  Properties  of  the  Equations . 

We  wish  to  give  a  procedure  to  find  the  solutions  of  both  systems 

of  equations  (2.5)  and  (2.6).  The  idea  is  that  given  a  suitable  x-j^  we 

find  X2  from  the  first  equation,  then  for  fixed  x-^  and  X2  obtain 

x^  from  the  second  equation,  based  on  the  X2  and  x^  we  get  x^  from 

the  third  equation,  finally  we  obtain  x^  from  the  last  second  equation. 

* 

In  other  words,  for  fixed  x^  ^  we  can  get  another  solution  x^  from  the 
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ft 

last  equation.  If  the  difference  between  x^  and  x^  is  very  small, 
then  x^,...,x^  are  the  solutions  required;  otherwise  we  modify  x^  and 
repeat  the  above  process.  We  will  prove  the  process  approaches  the 
solution. 

The  equations  in  (2.5)  and  (2.6)  can  be  classified  into  four 
groups.  (They  are  (4.1),  (4.2),  (4.5)  and  (4.7).)  Now  we  discuss 
their  properties  respectively. 

Let  x  ^,...,x  ^  and  xmQ(=0),  x  ^  denote  the  solutions 

of  (2.5)  and  (2.6),  respectively.  When  m  =  2,  there  is  only  one  equation 
in  (2.5),  i.e. 


(f)(0)  =  x1(l-^(0))  -  %x1 


Thus  x21  =  2(f)(0)  =  ilh  =  .797885. 

Theorem  1.  For  any  given  x^  >  0,  the  equation 


(4.1)  x1M(0,ig(x1+x2))  =  1 
or 

(4.1) '  <j)(0)-(f)(^(x1+x2))  =  x1(<K%(x1+x2))-$(0)) 


has  a  unique  solution  x2  =  g2(x^),  say, 
the  condition  is  satisfied  the  function 


if  and  only  if  x^  <  x^.  If 
g2(x^)  is  strictly  increasing. 


Proof.  Let 


From  Lemma  2  we  have 


K  (xi>xo)  =  -%x  M'(o,yj  .  >  0 

x2  1  2  1  y  y=%(x1+x2) 


and 


F(x1,x1)  =  l-x1M(0,x1)  <  0  . 


Thus  for  given  x^  >  0  the  equation  (4.1)  has  a  unique  solution  if  and 
only  if  F(x^,°°)  >0.  As 


F(x^,°°)  =  l~x_^M(0 ,°°)  =  l-x^CO)  =  1—x^  /it/2  . 


FCx^,")  >0  if  and  only  if  x_L  <  j/2/tt  =  x^. 

If  X21  >  xi»  82^x1^  is  strictly  increasing  and  if  dx^dx^ 
”Gi  (x1’X2)/Gx  (*i»x2^  >  °»  where  (cf.  (4.1)’) 

g(x1>x2)  =  tf>(0)-<J)(%(x1+x 2))-x1(^(h(x1+x2))-h)  . 

It  is  easily  shown  that 


and 


Gx  ^X1’X2^  =  ^x2'xi)^^x1+x2))  >  0 


Gx  ^xi,x2^  =  Js(x2-xiH(%(x1+x2))-  (§(k(x1+x2))-h) 


=  H(x1,x2>  , 


say.  From  Lemma  3 
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H(0,x2)  =  \  x2<j>(h  x2)  -  x2)  +  % 


=  hih  x2<t>(k  x2)+2(1-$(J5  x2))-1] 

=  h  <t>(h  x2)lh  x2+2m(%  x2)-i]  <  o  . 

As 

Hx1(x1,X2)  "  -^4>Ci5(x1+x2))(3  +  ^X2“xl))  <  0  > 

h(x1»x2)  <  0  for  0  <_  xx  <  x2,  thus  dx2/dx1  >  0.  Q.E.D. 

As  x2  =  g2(x^)  is  a  monotonic  function,  there  exist  a  =  g2(0+) 
and  b  =  g2(x21).  It  can  be  verified  that  a  =  0  and  b  =  °°,  i.e.,  the 
function  x2  =  g^x^)  is  increasing  from  0  to  00  when  x1  is  from 
0  to  x22* 

Theorem  2.  For  any  given  x^  _>  0,  the  equation 

(4.2)  x2M(%(x1+x2>)  =  1 

or 

(4.2) ’  <K%(x-]+x2))  =  x2(l-$(is(x1+x2))) 

has  a  unique  solution  x2  =  h(x1>  and  Mx^  is  strictly  increasing. 
Proof.  Let 

F(x1»x2)  =  <)>(j5(x1,x2))-x2(1-$(%(x;l+x2)))  ,  x2  >  x1  _>  0  . 

By  (3.9)  we  have 
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FCx^x^  =  <f>(x1)-x1(l-$(x1))  =  <j)(x1)  (l-x^Cx^)  >  0 

and  F(x^,°o)  =  0.  Further  we  have 

F^  (x1,x2)  =  <|)(%(x1+x2))  [%(x2-x1)-M(%(x1+x2))J  . 

As  M(x)  is  strictly  increasing  with  M(0)  =  /ir/2  and  t(x2)  s  %(x2-x^) 

is  a  strictly  increasing  function  of  x2  with  t(0)  =-h  x^,  there  exists 

an  Xq  (depending  on  x^) ‘  such  that  F^  (x^,x2)  <0  if  x2  <  xo> 

F'  (xn ,x  )  >0  if  x9  >  xn.  Combining  the  above  facts  we  obtain 
X^  x.  Z.  Z  U 

Figure  1,  the  graph  of  F(x^,x2)  as  a  function  of  x2* 


Figure  1 

Hence  the  equation  has  a  unique  solution.  Since  F'  (x..  ,x  )  <0  in  the 

X2  ^ 

neighborhood  of  (x^,h(x^))  and 

Fx  ^xi’x2^  “  js(x2~x1)4)(%(x1+x2))  >  0  , 
we  see  dx2/dx^  >0  in  the  neighborhood  of  (x^,h(x^))  for  all  x^  0. 
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Remark  1.  In  particular,  when  x1  =  0  the  equation  (4.2)’  reduces  to 


(4*3)  <£(%  x)  =  x(l-4>(%  x))  . 

It  has  a  unique  solution  and  the  solution  is  1.224014  by  the  bisection 
procedure  or  Newton  procedure.  It  is  the  solution  for  x1  in  the 
case  of  m  =  3,  i.e.,  x31  =  1.224014. 

Remark  2.  It  can  be  verified  that  when  x^  is  from  0  to  00 ,  X2  =  h(x^) 
is  from  x^  to  00 . 

Lemma  4.  If  0  <  y  </(8/3)  log  2 

(4.4)  k  y$(k  y)  <  $(y)  -  $(%  y)  . 

Proof.  Let 

f(y)  =  k  y<t>(k  y)  -  $(y)  +  $(h  y)  . 

As 

l  (y  -I't2 

%  y<^ (y)  < -  I  e  2  dt  <  ky<P(k  y)  , 

/2n  J%y 

we  have 

f  (y)  <  ky  4>(%y  )  -  k  y<Hy)  =  ky  (cK^y)  -  2<J> Cy) ) 

2 

=  ky<f>(ky)  (l-2e  )  <  0  , 

2 

if  y  <  (8/3)  log  2.  Q.E.D. 
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Theorem  3.  For  any  given  x^  >  0,  the  equation 

(4.5)  x1,%(x1+x2>)  =  1 
or 

(4.5) '  <$>(%  x1)  -  <j)(%(x1+x2))  =  x1[$(^(x1+x2))  -$(%  x^)  ] 

has  a  unique  solution  x2  =  t2(x^)  if  and  only  if  x^  <  x^ (=1. 224014) . 
And  t2(x^)  is  a  strictly  increasing  function. 

Proof.  Let 

F(xlSx2)  =  l-XjMC^  x1,%(x1+x2))  . 

From  Lemma  2  we  have 


Fi2<xl,It2)  ' 


-  %  x^  (h 


y=%(xi+x2) 


>  0 


and 


F(x1,x1)  =  l-x^M^  x1,x1)  <  0 


Thus  the  equation  (4.5)  has  a  unique  solution  if  and  only  if  F(x^,°°)  >  0. 
Noting  F(x^,°°)  =  l-x^M(%  x^,°°)  =  1-XjM(%  x^) ,  from  the  property  of  the 
equation  (4.3),  F(x^,co)  >0  if  and  only  if  x^  <  x^  (cf.  Figure  1.). 
In  order  to  prove  t2(x^)  is  monotonic,  we  start  at  (4.5)'.  Let 

G(x1,x2)  =<(>(%  xx)  -  4>(%(x1+x2))  -  x1[$(Jg(x1+x2))  -$(%  xx)]  . 

We  have 
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Gi1<xl,x2) 


kx1  <J>(%x1)  +3s(x2-x1)^)(^(x1+x2))- $(^(x14x2))  +$(%x1)  b  H(x 


say.  Then  dx2/dx^  >0  if  H(x^,x2)  <  0  for  0  <  x^  _<  x2  and  x^  <  x^. 
By  Lemma  4,  for  any  0  <  x^  <  x^(  <  (8/3)ln  2) 


H(x1,x1)  =  3sx1  (j><igx1)  -  [^(Xj^)  -  $(Jsx1>]  <  0  , 

Hx  ^X1’X2^  =  “  jt  4>(is(x1+x2))  (4  +  x2~x^)  <  0  . 

The  theorem  follows.  Q.E.D. 

It  can  be  shown  that  x2  =  t2(x.^)  is  from  zero  to  00  as  x^  goes 
from  zero  to  x^. 

In  the  case  of  m=4(k=2),  the  system  (2.5)  reduces  to 

(4.1) '  j<|>(0)  -  (J)(%(x1+x2))  =  x1[$(%(x1+x2))  -  %]  , 

(4.2) '  I  c|)(}2(x1+x2))  =  x2[l-$(%(x1+x2))]  . 

From  any  given  x^  >  0  we  can  obtain  x2  =  g2(x^)  (if  x^  <  x2^)  from 
(4.1)'  and  x*  =  h(x^)  from  (4.2)*,  respectively.  Figure  2  is  the 
graph  of  x2  and  x*.  By  computation  (in  detail  later),  we  find 
and  x^2 


x41  =  0.452781 


=  1.510437. 


Figure  2 

In  the  case  of  m  =  2k(k>2),  we  need  to  solve  the  system  of  equa¬ 
tions  (2.5).  From  Theorem  1,  for  any  0  <  x-^  <  x^  there  exists  a 
unique  solution  x^  =  g2(x^)  by  the  first  equation  of  (2.5).  We  wish 
to  obtain  x^  =  g^x^  from  the  second  equation  of  (2.5)  based  on  the 
x1  and  x2  =  g^x^. 

Lemma  5.  For  any  given  (x^g^x^),  the  equation 

(4*6)  x2M(Jg(x1+x2) ,  ^(x2+x3))  =  1 

or 

(4.6)’  <j>(%(x1+x2))  -  <|>(%(x2+x3))  =  x2[$(^(x2-hx3))  -  § (%(x1+x2))] 

has  a  unique  solution  x^  =  g3(x^)  if  and  only  if  x^  <  x^. 
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Proof.  Let 


F(x1,x2,x3>  =  1  -  x2MC%(x1+x2) ,  %(x2+x3))  ,  *3  >.  *2  —  xi  * 

By  using  Lemma  2  we  have 


(xrx2 


,x3)  =  -j£  x2M^(ig(x1+x2)  ,y) 


y=*5(x2+x3) 


>  0 


and 

F(xl5x2,x2)  =  1-x2M(%(x1+x2),x2)  <  0  . 

Thus  the  equation  (4.6)  has  a  unique  solution  if  and  only  if  F(x^,x2,oo) >  0. 
As 


F(x1,x2,°°)  =  1  -  x2M(%(x1+x2)> 

it  is  the  function  corresponding  to  the  equation  (4.2).  From  Figure  2 

and  thy  proof  of  Theorem  2,  F(x^,x2,°°)  >0  if  and  only  if  x^  <  x^.  Q.E.D. 

Similarly,  for  given  x^  we  have  in  turn  obtained  x2  =  g2(x^) , . . .  ,x^  =  g^(x^) 
from  the  1st, 2nd, . . . , (i-l)th  equation  of  (2.5),  then  the  equation. 

(4.7)  xiM(^(xi_14xi)  ,  ^(x^taE^))  =  1 

has  a  unique  solution  x^+^  =  ®i+l^xi^  if 'and  only  if  x^  <  x^  A  similar 
conclusion  is  correct  for  system  (2.6). 
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By  computation  we  obtain  Figure  3  and  Figure  4  for  the  case  of 

m  =  2k.  Figure  3  shows  us  that  each  x±  =  s± Cx1>  is  a  strictly  increasing 

function  of  And  Figure  4  expresses  the  relationship  between  x^ 

* 

and  x^  (depending  on  1)  and  between  x^  and  xfc  which  is  obtained 
by  solving  the  last  equation  of  (2.5)  for  the  same  x^_^ . 


0.1  0..2  0.3  0.4  0.5  0.6  0.7  0.8 

Figure  3 


Figure  4 


5»  Computational  Procedure  and  Results. 


Based  on  the  discussion  in  Section  4,  a  computational  procedure  is 
given  as  follows:  If  m  =  2k  do  the  following  steps: 

1)  Take  x ^  =  (k/(k+l))x2^_1^  ^  as  the  starting  point  with 

X21  =  °* 79 7885  and  let  LP  =  0  and  RP  =  x2(k-l)  where  (LP,RP) 
is  the  interval  in  which  x9  will  fall. 

2)  For  given  x^  in  turn  obtaining  x^g^x^  , . . .  >xk=gk(x,) 

from  the  first  (k-1)  equations  of  (2.5).  In  other  words,  for  the 

xk_j_  s°lve  xk  from  the  last  equation  of  (2.5)  and  denote  the  solution 
* 

as  x.  . 
k 

3)  Let  e  be  a  given  small  constant.  (We  take  e  =  10~5.)  There 

are  the  following  possible  situations: 

& 

a)  lxk-xkl  <  £’  c^en  xl,**,’xk  are  taken  as  the  solution  of 


(2.5),  i.e.,  X2k  I  = 
% 

b)  xk  <  xk  +  e  > 
small.  Let  LP  =  x1, 

c>  xk  >  -  e, 

and  return  to  2) . 


Xl”‘”X2k,k  =  V 

Figure  4  shows  that  the  starting  x.^  is  too 
and  x^  =  %  (LP+RP) ,  and  return  to  2) . 
x1  is  too  large.  Let  RP  =  x^  and  x^^  =  %(LP+RP) 


Obviously,  the  above  procedure  converges  to  the  solution,  because 
the  length  of  the  interval  (LP,RP)  reduces  to  half  of  the  original  one 
after  each  repeat.  There  is  a  similar  procedure  for  the  case  of 
m  =  2k+l. 

Tables  1  and  2  list  all  of  x  .  for  m  being  even  and  m  being 

m,j  °  e 

odd,  m  _<  31,  respectively.  If  we  want  to  classify  the  population  into 
m  subclasses,  the  cut-off  points  are  +(x  .+x  ... )/2,  i =  1,2, . . . ,[m/2] -1, 

where  [x]  denotes  the  largest  integer  which  is  less  than  or  equal  to  x. 
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When  m  is  even,  zero  is  a  cut-off  point.  Max  (1960)  listed  (x^ } 
and  cut-off  points  up  to  m  _<  36  but  Tables  1  and  2  have  more  decimal 
places  than  those  given  by  Max.  The  first  column  in  Table  1  and  Table  2 
lists  values  of  (1-loss  functions)%  which  give  us  the  information 
for  determining  the  value  of  m. 

Table  3  and  Table  4  give  probabilities  being  represented  by  {x^ 
for  m  being  even  and  m  being  odd,  respectively.  If  m  =  2k+l, 


xm  q  =  0,  and  we  need  list  it;  but  the  corresponding  probability 
should  be  listed.  Hence  the  forms  of  Table  2  and  Table  4  are  a  little 


different. 


Let  us  look  at  a  use  of  the  Tables.  Suppose  we  want  to  design 

clothes  for  a  group  of  people  and  know  that  height  in  this  group  is 

o 

distributed  according  to  N(y,cr  )  with  y  =  170cm  and  a  =  10cm. 

Let  m  be  the  number  of  sizes  that  we  wish  to  produce,  that  is  we 
wish  to  determine  representative  heights  for  m  models.  If  m  =  7, 
we  find  from  Table  2  that  the  representative  points  are  x^q  =  0, 
x^  =  0.560607,  x^2  =  1.188219  and  x^  =  2.033827.  Therefore,  the 
heights  of  m  =  7  models  are  y  +  x-^a,  i  =  1,2,3,  i.e.  149.7,  158.1, 

164.6,  170.0,  175.6,  181.9  and  190.7.  The  first  column  of  Table  2 
shows  us  that  use  of  these  7  height  categories  instead  of  the  continuum 
represents  a  loss  of  information  equal  to  4.4%  as  measured  by  our  mini¬ 
mized  square  loss  function  (1.1).  If  we  partition  the  group  into  7  sub¬ 
groups,  the  cut-off  points  are  153.9  ( (149 . 7+158 .l)/2) ,  161.4,  167.3, 

172.8,  178.8,  and  186.3.  The  relative  frequencies  associated  with  these  sub¬ 
groups  are  5.36%,  13.74%,  19.87%,  22.08%,  19.87%,  13.74%  and  5. 36%. by  Table  4. 
If  we  assume  m  =  6,  then  from  Table  1  and  Table  3,  the  representative 
points  are  151.1,  160.0,  166.8,  173.2,  180.0  and  188.9,  the  cut-off  points 
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are  155.6,  163.4,  170.0,  176.6  and  184.5,  with  corresponding  relative  fre¬ 
quencies  7.39%,  18.10%,  24.50,  24.50%,  18.01%  and  7.39%;  and  the  loss  of 
information  is  5.8%.  Certainly  for  m  >_  10,  the  increase  in  information 
is  quite  negligible.  It  is  only  for  the  smaller  m  values  that  increasing 
or  decreasing  the  number  of  categories  may  be  significant  in  the  tradeoff 
with  information  gain  or  loss. 
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TABLE  1  X(inij)  <m  -  even) 

ml  1-LOSS  FU  j=l  J=2  j=3  J=4  J=5  J=6  j«=7 


21  63.  661990  0.  797885 
41  88.  252192  0.452781  1.510437 

61  94.204586  0.317726  1.000143  1.893830 

81  96.  549558  0.  245112  0.  756060  1.  344048  2.  152724 

10!  97.712362  0.199659  0.609934  1.057975  1.591756  2.346883 

12!  98.  373484  0.  16B469  0.  511949  0.  876964  1.  286063  1.  783980  2.  501750 

14!  98.  785352  0.  145751  0.  441444  0.  750661  1.  085993  1.  468236  1.  940410  2.  630429 

16!  99.059320  0.128443  0.388197  0.657018  0.942738  1.256885  1.619344  2.072074 

18!  99.  250818  0.  114824  0.  346516  0.  584599  0.  834305  1.  102760  1.  400933  1.  748159 

20!  99.389886  0.103828  0.312991  0.526833  0.749039  0.984354  1.239527  1.525192 

22!  99.494100  0.094761  0.285426  0.479624  0.680056  0.890055  1.114100  1.358719 

24!  99.574232  0.087156  0.262357  0.440290  0.623002  0.812935  1.013227  1.228196 

26!  99.637198  0.080687  0.242766  0.407005  0.574992  0.748576  0.930052  1.122436 

28!  99.687480  0.075117  0.225920  0.378458  0.533995  0-693966  0.860102  1.034604 

30!  99.728346  0.070272  0.211280  0.353707  0.498570  0.647011  0.800374  0.960315 

m!  j=S  j=9  J=10  j=ll  J=12  J=13  J-14  j=*15 


16!  2.  740625 

18!  2.  185671  2.837083 

20!  1.  860332  2.  285642  2.  923058 

22!  1.  634326  1.  959668  2.  375024  3.  000824 

24!  1.  464226  1.  731595  2.  048884  2.  456031  3.  072078 

26!  1.  329943  1.  558865  1.819393  2.  129989  2.  530294  3.  138023 

28!  1.  220415  1.  421689  1.  644658  1.  899454  2.  204461  2.  599053  3.  199659 

30!  1.  128976  1.309258  1.505270  1.723212  1.973188  2.273515  2.663323  3.257769 

TABLE  2  X<m, j)  <m  -  odd  > 

m!  1-LOSS  FU  j=l  j=2  j«=3  j*=4  j=5  J*=6  j=7 


3!  80.982620  1.224014 
5!  92.007208  0.764582  1.724227 

7!  95.603322  0.560607  1.188219  2.033827 

9!  97.219908  0.443675  0.918894  1.476651  2.255903 

11  !  98.  084844  0.  367515  0.  752492  1.  179063  1.  693290  2-  428242 

13!  98.601734  0.313847  0.638403  0.987206  1.381766  1-865845  2.568793 

151  98.935282  0.273945  0.554949  0.851433  1.  175366  1.547025  2.008841  2.687479 

17!  99.163116  0.243098  0.491099  0.749630  1.026111  1.331750  1.686128  2.130809 

19!  99.325586  0.218530  0.440605  0.670188  0.912228  1.173637  1.465193  1.806041 

21!  99.445534  0.198496  0.399639  0.606335  0.822066  1.051424  1.301035  1.581410 

23!  99.  536706  0.  181846  0.  365723  0.  553826  0.  748709  0.  953596  1.  172873  1. 412985 

251  99.607540  0.167788  0.337169  0.509846  0.687751  0.873242  1.069389  1.280453 

27!  99.  663686  0.  155762  0.  312798  0.  472454  0.  636232  0.  805914  O.  983741  1.  172690 

29!  99.708938  0.145357  0.291748  0.440256  0.592077  0.748591  0.911489  1.082946 

31!  99.745920  0.136258  0.273375  0.412228  0.553784  0-699141  0.849610  1.006826 

mi  j=8  j=9  j=10  j=ll  j  =  ‘12  j=  13  j~14  j=15 


17!  2.  790329 

19!  2.  237125  2.  881189 

21!  1.911404  2.331491  2.962832 

23!  1.684276  2.005400  2.416462  3.037181 

25!  1.  512767  1.  776563  2.090355  2.  493931  3.  105654 

27!  1.  376945  1.  602762  1.  860301  2.  167977  2.  565299  3.  169331 

29!  1.265876  1.464410  1.684760  1.937043  2.239604  2.631692  3.229091 

31  !  1.  172913  1.  350754  1.  544445  1.  760177  2.  008046  2.  306347  2.  694096  3.  285904 
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TABLE  3 


m! 

J=0 

4  =  1 

J-2 

4=3 

J=4 

4=5 

J=6 

4=7 

2! 

0.  500000 

41 

0.  336855 

0.  163145 

6 ' 

0.  245031 

0.  181023 

0. 

073946 

81 

0.  191669 

0.  161486 

0. 

106654 

0. 

040192 

ios 

0.  157187 

0.  140660 

0. 

109548 

0. 

068155 

0.  024450 

12! 

0.  133151 

0.  123152 

0. 

103965 

0. 

077349 

0.  046331 
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